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Abstract 

Let $ be a uniformly distributed random fc-SAT formula with n variables and m clauses. We present 
a polynomial time algorithm that finds a satisfying assignment of <& with high probability for constraint 
densities m/n < (1 — £fc)2 fc ki(fc)/fc, where Ek — > 0. Previously no efficient algorithm was known 
to find solutions with non-vanishing probability beyond m/n = 1.817 • 2 k /k [Frieze and Suen, J. of 
Algorithms 1996]. 

Key words: random structures, efficient algorithms, phase transitions, fc-SAT. 

1 Introduction 

1.1 Solving random fc-SAT 

The fc-SAT problem is well known to be NP-hard for k > 3, and in fact no algorithm with a sub-exponential 
worst-case running time is known to decide whether a given fc-SAT formula has a satisfying assignment. 
Nevertheless, that fc-SAT is NP-hard merely indicates that no algorithm can solve all possible inputs effi- 
ciently. Therefore, there has been a significant amount of research on heuristics for fc-SAT, i.e., algorithms 
that solve "most" inputs efficiently, where the meaning of "most" depends on the scope of the respective 
paper. While some heuristics for fc-SAT are very sophisticated, virtually all of them are based on (at least) 
one of the following basic paradigms. 

Pure literal rule. If a variable x occurs only positively (resp. negatively) in the formula, set it to true (resp. 
false). Simplify the formula by substituting the newly assigned value for x and repeat. 

Unit clause propagation. If the formula contains a clause that contains only a single literal ("unit clause"), 
then set the underlying variable so as to satisfy this clause. Then simplify the formula and repeat. 

Walksat. Initially pick a random assignment. Then repeat the following. While there is an unsatisfied 
clause, pick one at random, pick a variable occurring in the chosen clause randomly, and flip its 
value. 

Backtracking. Assign a variable x, simplify the formula, and recurse. If the recursion fails to find a 
satisfying assignment, assign x the opposite value and recurse. 

Heuristics based on these paradigms can be surprisingly successful (given that fc-SAT is NP-hard) on 
certain types of inputs (e.g., lfT0l[l6l ). However, it remains remarkably simple to generate formulas that 
elude all known algorithms/heuristics. Indeed, the simplest conceivable type of random instances does the 
trick: let 3? denote a fc-SAT formula over the variable set V = {x\, . . . ,x n } that is obtained by choosing 
m clauses uniformly at random and independently from the set of all (2n) k possible clauses. Then for a 
large regime of constraint densities m/n satisfying assignments are known to exist due to non-constructive 
arguments, but no algorithm is known to find one in sub-exponential time with a non-vanishing probability. 
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To be precise, keeping k fixed and letting m = \rn\ for a fixed r > 0, we say that $ has some property 
with high probability ("w.h.p.") if the probability that the property holds tends to one as n — ► oo. Via the 
(highly non-algorithmic) second moment method and the sharp threshold theorem |3]|4][14] it can t> e shown 
that 3? has a satisfying assignment w.h.p. if m/n < (1 — £fc)2 fe In 2. Here £& tends to for large k. On the 
other hand, a first moment argument shows that no satisfying assignment exists w.h.p. if m/n > 2 k In 2. 
In summary, the threshold for 3? being satisfiable is asymptotically 2 k In 2. 

But for densities m/n beyond 0(2 k /k) no algorithm has been known to find a satisfying assignment 
in polynomial time with probability f2(l) - neither on the basis of a rigorous analysis, nor on the basis 
of experimental or other evidence. In fact, many algorithms, including Pure Literal, Unit Clause, and 
DPLL-type algorithms, are known to fail or exhibit an exponential running time beyond 0(2 k /k). There 
is experimental evidence that the same is true of Walksat. Indeed, devising an algorithm to solve random 
formulas with a non-vanishing probability for densities m/n up to 2 k cu(k)/k for any w(k) — * oo has been 
a well-known open problem [3 4ll8l l2H . which the following theorem resolves. 

Theorem 1.1 There are a sequence Ek — > and a polynomial time algorithm Fix such that Fix applied 
to a random formula <fr with m/n < (1 — £fc)2 fc ln(fc) /k outputs a satisfying assignment w.h.p. 

Fix is a combinatorial, local-search type algorithm. It can be implemented to run in time 0(n + m) 3 / 2 . 

The recent paper Q provides evidence that beyond density m/n = 2 k ln(fc) / k the problem of finding 
a satisfying assignment becomes conceptually significantly more difficult (to say the least). To explain this, 
we need to discuss a concept that originates from statistical physics. 

1.2 A digression: replica symmetry breaking 

For the last decade random fc-SAT has been studied by statistical physicists using sophisticated, insightful, 
but mathematically highly non-rigorous techniques from the theory of spin glasses. Their results suggest 
that below the threshold density 2 fc In 2 for the existence of satisfying assignments various other phase 
transitions take place that affect the performance of algorithms. 

To us the most important one is the dynamic replica symmetry breaking (dRSB) transition. Let S(G>) C 
{0, 1} V be the set of all satisfying assignments of the random formula 3>. We turn 5(<&) into a graph by 
considering a, t £ S(3>) adjacent if their Hamming distance equals one. Very roughly speaking, according 
to the dRSB hypothesis there is a density r^sB such that for m/n < trsb the correlations that shape the 
set S(&) are purely local, whereas for densities m/n > trsb long range correlations occur. Furthermore, 
r RSB ~2 k ln(k)/k. 

Confirming and elaborating on this hypothesis, we recently established a good part of the dRSB 
phenomenon rigorously J2J- In particular, we proved that there is a sequence Ek — > such that for 
m/n > (1 + £k)2 k ln(fc)/fc the values that the solutions a £ assign to the variables are mutu- 

ally heavily correlated in the following sense. Let us call a variable x frozen in a satisfying assignment 
a if any satisfying assignment r such that a(x) ^ t(x) is at Hamming distance O(n) from a. Then for 
m/n > (1 + £fc)2 fc \n(k)/k in all but a o( Infraction of all solutions a £ S(&) all but an £fe-fraction of the 
variables are frozen w.h.p., where £fe — ► 0. 

This suggests that on random formulas with density m/n > (1 + £fe)2 fc \n(k)/k local search algorithms 
are unlikely to succeed. For think of the factor graph, whose vertices are the variables and the clauses, and 
where a variable is adjacent to all clauses in which it occurs. Then a local search algorithm assigns a value 
to a variable x on the basis of the values of the variables that have distance 0(1) from x in the factor graph. 
But in the random formula $ with m/n > (1 + £fc)2 fc \n(k)/k assigning one variable x is likely to impose 
constraints on the values that can be assigned to variables at distance SI (In n) from x in the factor graph. 

The above discussion applies to "large" values of k (say, k > 10). In fact, non-rigorous arguments as 
well as experimental evidence j5 | suggest that the picture is quite different and rather more complicated 
for "small" k (say, k = 3, 4, 5). In this case the various phenomena that occur at (or very near) the point 
2 k ln(fc) / k for k > 10 appear to happen at vastly different points in the satisfiable regime. To keep matters 
as simple as possible we focus on "large" k in this paper. In particular, no attempt has been made to derive 
explicit bounds on the numbers in Theorem ll.ll for "small" values of k. 
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Algorithm 


Density m/n < 


Success probability 


Ref., year 


Pure Literal ("PL") 


o(l) as k — ► 00 


w.h.p. 


[19|, 2006 


Walksat, rigorous 
Walksat, non-rigorous 


i • 2 fc /fc 2 

6 ' 

2 fc /fc 


w.h.p. 
w.h.p. 


191 2009 
|221, 2003 


Unit Clause ("UC") 


1 / k-l\ k ~ 2 2 k 

2 \ k-2 ) ' fe 


n(i) 


E), 1990 


Shortest Clause ("SC") 


1 / fe-l\ te_3 fe-1 2 fc 
8 ^ fe-3 J k-2 ' k 


w.h.p. 


0, 1992 


SC+backtracking ("SCB") 


~ 1.817- ^ 


w.h.p. 


[15|, 1996 


BP+decimation ("BPdec") 
(non-rigorous) 


e-2 fc /fc 


w.h.p. 


ED. 2007 



Table 1 : Algorithms for random k-S AT 



1.3 Related work 

Quite a few papers deal with efficient algorithms for random fc-SAT, contributing either rigorous results, 
non-rigorous evidence based on physics arguments, or experimental evidence. Table 03 summarizes the 
part of this work that is most relevant to us. The best rigorous result (prior to this work) is due to Frieze 
and Suen fT31 , who proved that "SCB" succeeds for densities r)k2 /k, where % is increasing to 1.817 
as A: — ► 00. SCB can be considered a (restricted) DPLL-algorithm. More precisely, SCB combines the 
shortest clause rule, which is a generalization of Unit Clause, with (very limited) backtracking. Conversely, 
it is known that DPLL-type algorithms require an exponential running time w.h.p. for densities beyond 

o{2 k /k) CQ. 

Montanari, Ricci-Tersenghi, and Semerjian [21) provide evidence that Belief Propagation guided dec- 
imation may succeed up to density c ■ 2 k /k. This algorithm is based on a very different paradigm than 
the others mentioned in Table Q] The basic idea is to run a message passing algorithm ("Belief Propaga- 
tion") to compute for each variable the marginal probability that this variable takes the value true/false in a 
uniformly random satisfying assignment. Then, the decimation step selects a variable, assigns it the value 
true/false with the corresponding marginal probability, and simplifies the formula. Ideally, repeating this 
procedure will yield a satisfying assignment, provided that Belief Propagation keeps yielding the correct 
marginals. Proving (or disproving) this remains a major open problem. 

Survey Propagation is a modification of Belief Propagation that aims to approximate the marginal 
probabilities induced by a particular (non-uniform) probability distribution on the set of satisfying assign- 
ments J6|. It can be combined with a decimation procedure as well to obtain a heuristic for finding a 
satisfying assignment. There is (non-rigorous) evidence that for most of the satisfiable regime (actually 
m/n < 2 k ln2 — 0(1)) Belief and Survey Propagation are essentially equivalent [201- Hence, there is no 
evidence that Survey Propagation finds satisfying assignments beyond 0(2 k /k) for general k. 

In summary, various algorithms are known/appear to succeed for densities c • 2 k jk, where the constant 
c depends on the particulars of the algorithm. But I am not aware of prior evidence (either rigorous results, 
non-rigorous arguments, or experiments) that some algorithm succeeds for densities m/n = 2 k oj(k)/k 
with u>(k) — * 00. 

The discussion so far concerns the case of general k. In addition, a large number of papers deal with the 
case k = 3. Flaxman lfl3l provides a survey. Currently the best rigorously analyzed algorithm for random 
3-SAT is known to succeed up to m/n = 3.52 ifTTll . This is also the best known lower bound on the 3-SAT 
threshold. The best current upper bound is 4.506 ifTTI . and non-rigorous arguments suggest the threshold 
to be m 4.267 (5). As mentioned in Section [l~2l there is non-rigorous evidence that the structure of the 
set of all satisfying assignment evolves differently in random 3-SAT than in random fc-SAT for "large" k. 
This may be why experiments suggest that Survey Propagation guided decimation for 3-SAT succeeds for 
densities m/n up to 4.2 J6). 

1.4 Techniques and outline 

The algorithms for random fc-SAT from lP7l[8"l [l"5ll all follow a very simple scheme: 
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Initially all variables are unassigned. In each step apply some rule (referring to the previously 
assigned variables/values only) to select a currently unassigned variable. Assign the selected 
variable for good, simplify the formula, and proceed. 

The Unit Clause algorithm is the prototypical example: the underlying rule is to check if there is a clause 
that has k — 1 false literals due previous decisions; if so, the algorithm sets the last unassigned variable 
so as to satisfy the clause. Otherwise the algorithm picks an unassigned variable randomly and assigns it 
a random value. (The algorithm SCB from 1151 deviates from this pattern slightly as it may backtrack to 
revise previous assignments, but this happens at most 0(ln 2 n) times w.h.p.) 

The analysis of such algorithms is based on the "method of deferred decisions". Suppose we apply 
the algorithm to a random formula and condition on the occurrences of the variables assigned in the first 
t steps. Assume that these are precisely the variables xi,...,Xt. Then all the literals whose underlying 
variable is none of xi, . . . , Xt remain stochastically independent and uniformly distributed over set of the 
remaining 2(n — t) literals. This fact makes it possible to either model the execution of the algorithm by 
differential equations 0O, or by a Markov chain lfl5l . Of course, this type of analysis crucially exploits 
the fact that the algorithm (almost) never revises previous decisions. 

Instead of assigning one variable at a time, the Walksat algorithm starts from a complete (e.g., randomly 
chosen) assignment of truth values to all the variables. Of course, this initial assignment is very unlikely to 
be satisfying. Hence, while there is an unsatisfied clause, the algorithm picks one of them at random and 
flips the value of a randomly chosen variable occurring in that clause. Since Walksat actually starts from 
a complete assignment and may flip the value of the same variable several times, the method of deferred 
decisions does not apply. In fact, although experimental (and non-rigorous) evidence suggests that Walksat 
finds a satisfying assignment in linear time w.h.p. for m/n < 2 k /k, the best current rigorous analysis only 
shows this for m/n < 2 k / (6k 2 ) H. (The proof is based on relating Walksat to a branching process.) 

The algorithm Fix for Theorem ll.ll is similar to Walksat in that it starts with a complete assignment - 
say, for the sake of concreteness, the one that sets all variables to true. The number of unsatisfied clauses is 
(1 + o(l))2~ k m w.h.p. To reach a satisfying assignment, Fix will have to flip (at least) one variable from 
each of these clauses. But in contrast to Walksat, Fix does not choose this variable randomly. Instead 
Fix applies a greedy rule: whenever possible choose a variable x so that flipping x does not generate 
new unsatisfied clauses. Thus, one could consider Fix a greedy version of Walksat. We will describe the 
algorithm precisely in Section[3] 

The analysis of Fix is based on a blend of probabilistic methods (e.g., martingales) and combinatorial 
arguments. We can employ the method of deferred decisions to a certain extent: the analysis "pretends" that 
the algorithm exposes the literal occurrences of the random input formula only when it becomes strictly 
necessary, so that the unexposed ones remain "random". However, the picture is not as clean as in the 
analysis of, say, Unit Clause. The reason is that we will have to track certain rather non-trivial random 
variables throughout the process, for which we will resort to a direct combinatorial analysis. Section [3] 
contains an outline of the analysis, the details of which are carried out in Section[4H6] Before we come to 
this, we need a few preliminaries. 

2 Preliminaries and notation 

In this section we introduce some notation and present a few basic facts. Although most of them (or closely 
related ones) are well known, we present some of the proofs for the sake of completeness. 

2.1 Balls and bins 

Consider a balls and bins experiment where \i distinguishable balls are thrown independently and uniformly 
at random into n bins. Thus, the probability of each distribution of balls into bins equals n _At . 

Lemma 2.1 Let Z(u, n) be the number of empty bins. Let A = ncxp(—u/n). Then P [2(fi, n) < A/2] < 
0(JJC) • exp(— A/8) as n — > oo. 
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The proof is based on the following Che rnoff bound on the tails of a binomially distributed random variable 
X with mean A (see fl8l pages 26-28]): for any t > 

P(X>X + t)<cxp (- —L— ^j and P(X < A - t) < exp (-L) . (1) 

Proof of Lemma |Z7] Let Xi be the number of balls in bin i. In addition, let (Y)i<i<n be a family of 
mutually independent Poisson variables with mean p/n, and let Y = Y^=i Then Y has a Poisson 
distribution with mean p. Therefore, Stirling's formula shows P [Y = /x] = 6(/i~ 1,/2 ). Furthermore, 
the conditional joint distribution of Yi, . . . , Y„ given that Y = p coincides with the joint distribution of 
Xi, . . . , X n (see, e.g., lfT2l Section 2.6]). As a consequence, 

P[Z(p,n)<X/2] = P [\{i G [n] : Yi = 0}| < A/2|Y = p] 

s P[H,cN:^0)|<A/2] _ p e w : ^ _ 0}| < A/2] (2> 

Finally, since Y\ , . . . , Y n are mutually independent and P [Y, =0] = A/n for all 1 < z < n, the number of 
indices i £ [n] such that Y = is binomially distributed with mean A. Thus, the assertion follows from (j2]i 
and the Chernoff bound ([TJi. □ 



2.2 Random &>SAT formulas 

Throughout the paper we let V = V n = {xi, . . . , x n } be a set of propositional variables. If Z C V, then 
Z = {x : x € Z} contains the corresponding set of negative literals. Moreover, if / is a literal, then |/| 
signifies the underlying propositional variable. If [i is an integer, let \p\ ={1,2,..., /j,}. 

We let f2fc(n, m) be the set of all fc-SAT formulas with variables from V = {xi, . . . , x n } that contain 
precisely m clauses. More precisely, we consider the formula an ordered m-tuple of clauses and each 
clause an ordered fc-tuples of literals, allowing both literals to occur repeatedly in one clause and clauses 
to occur repeatedly in the formula. Let Sfc(n, m) be the power set of f2fe(n, m), and let P = Pfe(n, m) 
be the uniform probability measure (which assigns probability (2n)~ krn to each formula). We obtain a 
probability space (f2fe(n, to), Efc(n, to), P). 

Throughout the paper we denote a random element of il^ (n, to) by <&. Unless otherwise specified, $ 
is uniformly distributed. In addition, we use $ to denote specific (i.e., non-random) elements of fi/j(n, m). 
If $ £ Oft(n, to), then $i denotes the ith clause of $, and $y denotes the jth literal of $j. 

Lemma 2.2 For any <5 > and any k > 3 fnere w no > 5mc/i that for all n > rio the following is 
true. Suppose that m > Sn and that Xi : Ofe(n, to) — > {0, 1} is a random variable for each i £ [to]. Let 
jjL = [in 2 n] . If there is a number \ > 5 such that for any set M C [to] of size p we have 



E 



Lie A/ 



< A M , then P 



> (1 + <5)Ar 



< n 



Proof. Let be the number of sets M C [to] of size p such that JlieA/ = ^ Then E < 
If X = 2™ ! Xi>L= + S)Xm] , then > ( L ) . Consequently, by Markov's inequality 



P [X > L] < P 



M > 



L 



< E [M] < 



)A^ 



< 



Ato 
L — p 



< 



Ato 



(1 + <5)Ato — /i 



Since A?n > S 2 n we see that (1 + S)Xm — p > (1 + 6/2)Xm for sufficiently large n. Hence, P [X > L] < 



(1 + 5/2)-" < 



-10 



for large enough n. 



□ 



Although we allow variables to appear repeatedly in the same clause, the following lemma shows that 
this occurs very rarely w.h.p. 
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Lemma 2.3 Suppose that m = 0(n). Then w.h.p. there are at most Inn indices i € [to] such that one of 
the following is true. 

1. There are 1 < j\ < ji < k such that \ = \<&ij 2 1- 

2. There is i' ^ i and indices j\ ^ j% j[ ^ j' 2 such that \ = \ and \&ij 2 \ = \- 
Furthermore, w.h.p. no variable occurs in more than In 2 n clauses. 

Proof. Let X be the number of such indices i for which 1. holds. For each i G [m] and any pair 1 < ji < 
]2 < k the probability that \ = \*&ij 2 1 is because each of the two variables is chosen uniformly 
at random. Hence, by the union bound the probability that there are ji, such that \ = \&ij 2 1 is at 
most (2) /n. Consequently, E [X] < m{^j/n = O(l), and thus X < i Inn w.h.p. by Markov's inequality. 

Let Y be the number of i G [?n] for which 2. is true. For any given i, i' , ji, j[, J2> J2 the probability 
that 1 = I and \ &ij 2 \ = \$vj' | is 1/n 2 . Furthermore, there are m 2 ways to choose i, i' and then 
(k(k — l)) 2 ways to choose ji,j[,j2,j<2- Hence, E [Y] < m 2 k 4 n~ 2 = O(l). Thus, X < ^ In n w.h.p. by 
Markov's inequality. 

Finally, for any variable x the number of indices i € [to] such that x occurs in 4?^ has a binomial 
distribution Bin(m, 1 — (1 — l/n) k ). Since the mean to- (1 — (1— l/n) k ) is 0(1), the Chernoff bound ([TJ 
implies that the probability that x occurs in more than In 2 n clauses is o(l/n). Hence, by the union bound 
there is no variable with this property w.h.p. □ 

Recall that a filtration is a sequence (!Ft)o<t<T of cr-algebras Tt C Sfc(n, m) such that Tt C Tt+\ for 
all < t < t. For a random variable X we let E [X|jF t ] denote the conditional expectation (which is a 
random variable). Remember that P [-{Tt] assigns a probability measure P [-\Tt\ ($) to any $ G Qk(n, to), 
namely 

P [\T t ] ($) : A G E fc (n, m) ^ E [l A |^ t ] ($), 
where l A (y) = US (f € A and 1a(v) = otherwise. 

Lemma 2.4 Let {Tt)o<t<T a filtration and let (X t )i<t< T be a sequence of random variables such that 
each X t is Tt-measurable. Assume that there are numbers £ t > such that E [Xt|^t_i] < £t for all t. 
ThenElil^^XtlTo] < Ui<t<r^ 

Proof. For 1 < s < r we let Y s = Yit=i ^t- Let s > 1. Since F s _i is T s -i -measurable, we obtain 

E [F S |JT ] = E [Y S ^X S \T Q ] = E [E [Y S ^X S \T S ^] \T Q ] = E [Y S ^E [X S \T S ^] \T ] < £ S E [F s _i|^ ] , 

whence the assertion follows by induction. □ We also need the following tail bound ("Azuma-Hoeffding", 
e.g. HSp. 37]). 

Lemma 2.5 Let (M t )o<t< T be a martingale such that Mq = E [M T ]. Suppose that \ AI t — M f _i| < c t for 
alll<t<T. Thenfoiany A > P [\M T - M | > A] < exp [-A 2 /(2 Y7 t=1 c 2 )] . 

Finally, we need the following bound on the number of clauses that have "few" positive literals in total 
but contain at least one positive variable from a "small" set. 

Lemma 2.6 There is a constant a > such that for all k > 3 and m/n < 2 fc fc _1 lnfc the following 
is true. Let 1 < I < s/k and set S = ak~ Al . For a set Z C V let Xz be the number of indices 
i G [m] such that €>i is a clause with precisely I positive literals that contains a variable from Z. Then 
max {A^ : \Z\ < Sn} < y/5nw.h.p. 

Proof. Let p = \\ZSn\ . We use a first moment argument. Clearly we just need to consider sets Z of size 
[<5nJ . Thus, there are at most (J^) ways to choose Z. Once Z is fixed, there are at most ( m ) ways to 
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choose a set T C [m] of size fi. For each iel the probability that contains a variable from Z and has 
precisely I positive literals is at most 2 1 ~ k k( k l )S Hence, by the union bound 



P [max{X z : \Z\ < 5n} > fj] < 



< 



< 



< 



; r 

on J \ « 



2 1 ~ k k 



< 



c y« / 2cln(fc)(;)5n X 



/' 



e\<5" / 2efcm(j)5\ 
1, 2^ ) 

[as to < 2 fe fc~ 1 lnfc] 



e\ Sr 
5J 

6. 



Ueln(k) -k l -VS 



exp 



[because /j = ] 
<W 8 [as 6 = ak~ 41 for a small a > 0] 

[ V5(l -hiS) + -hx5 



The last expression is o(l), because v^(l — In S) + i In S is negative for sufficiently small 6. 



□ 



3 The algorithm Fix 

In this section we present the algorithm Fix. To establish Theorem 1 1.1 1 we will prove the following: for 
any < e < 0.1 there is kg = fco(e) > 3 such that for all k > ko the algorithm Fix outputs a satisfying 
assignment w.h.p. when applied to $ with to = [(1 — e)2 k k^ 1 lnfcj. Thus, we assume that k exceeds 
some large enough number fco depending on e only. In addition, we assume throughout that n > uq for 
some large enough no = no(e, k). We set 

uj = (1 - e) In k and k x = \k/2\ . 

Let $ € £lk(n, m) be a fc-SAT instance. When applied to $ the algorithm basically tries to "fix" the 
all-true assignment by setting "a few" variables Z C V to false so as to satisfy all clauses. Obviously, 
the set Z will have to contain one variable from each clause consisting of negative literals only. The key 
issue is to pick "the right" variables. To this end, the algorithm goes over the all-negative clauses in the 
natural order. If the present all-negative clause does not contain a variable from Z yet, Fix (tries to) 
identify a "safe" variable in which it then adds to Z. Here "safe" means that setting the variable to false 
does not create new unsatisfied clauses. More precisely, we say that a clause $^ is Z-unique if $i contains 
exactly one positive literal from V\Z and no negative literal whose underlying variable is in Z. Moreover, 
x 6 V \ Z is Z -unsafe if it occurs positively in a Z-unique clause, and Z-safe if this is not the case. Then 
in order to fix an all-negative clause <1>; we prefer Z-safe variables. 

To implement this idea, Fix proceeds in three phases. Phase 1 performs the operation described in the 
previous paragraph: try to identify a Z-safe variable in each all-negative clause. Of course, it will happen 
that an all-negative clause does not contain a Z-safe variable. In this case Fix just picks the variable 
in position ki. Consequently, the assignment constructed in the first phase will not satisfy all clauses. 
However, we will prove that the number of unsatisfied clauses is very small, and the purpose of Phases 2 
and 3 is to deal with them. Before we come to this, let us describe Phase 1 precisely. 



Algorithm 3.1 Fix($) 

Input: A fc-SAT formula $. Output: Either a satisfying assignment or "fail". 

1a. LetZ = 0. 

1 b. For i = 1, . . . , m do 

1 c. If $i is all-negative and contains no variable from Z 

1 d. If there is 1 < j < fci such that | is Z-safe, then pick the least such j and add | to Z. 

1 e. Otherwise add I*; fcl | to Z. 
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The following proposition, which we will prove in SectionHl summarizes the analysis of Phase 1. Let az 
be the assignment that sets all variables in V \ Z to true and all variables in Z to false. 

Proposition 3.2 At the end of the first phase of Fix(<&) the following statements are true w.h.p. 

1. We have \Z\ < ink' 1 lnu>. 

2. At most (1 + e/3)wn clauses are Z -unique. 

3. At most exp(— fc £ / 8 )n clauses are unsatisfied under az- 

Since the probability that a random clause is all-negative is 2~ fc , under the all-true assignment (1 + 
o(l))2~ fc m ~ un/k clauses are unsatisfied w.h.p. Hence, the outcome az of Phase 1 is already a lot 
better than the all-true assignment w.h.p. 

Step Id only considers indices 1 < j < kx. This is just for technical reasons, namely to maintain a 
certain degree of stochastic independence to facilitate (the analysis of) Phase 2. 

Phase 2 deals with the clauses that are unsatisfied under az- The general plan is similar to Phase 1: we 
(try to) identify a set Z' of "safe" variables that can be used to satisfy the cr^-unsatisfied clauses without 
"endangering" further clauses. More precisely, we say that a clause $i is (Z, Z')-endangered if there is no 
1 < j < k such that the literal $jj is true under az and|$y| G V\ Z' . In words, $^ is (Z, Z')-endangered 
if it relies on one of the variables in Z' to be satisfied. Call $^ (Z, Z')-secure if it is not (Z, Z')-endangered. 
Phase 2 will construct a set Z' such that for all 1 < i < m one of the following is true: 

• $ t is (Z, Z')-secure. 

• There are at least three indices 1 < j < k such that G Z' . 

To achieve this, we say that a variable a; is (Z, Z')-unsafe if x G ZUZ' or there are indices G [m] x [k] 
such that the following two conditions hold: 

a. For all j ^ I we have $ y eZUZ'U V\Z. 

b. <5> l i = x. 

(In words, x occurs positively in and all other literals of $^ are either positive but in Z U Z 1 or negative 
but not in Z.) Otherwise we call x (Z, Z')-safe. In the course of the process, Fix greedily tries to add as 
few (Z, Z')-unsafe variables to Z' as possible. 

2a. Let Q consist of all i e [m] such that $ 4 is unsatisfied under az- Let Z' = 0. 

2b. While Q ^ 

2c. Leti = minQ. 

2d. If there are indices fei < ji < j 2 < 33 < k - 5 such that 1$^, | is (Z, Z')-safe for 1 = 1,2, 3, 

pick the lexicographically first such sequence and add \, \$ij 2 \, |$ij 3 | to Z'. 
2e. else 

let k - 5 < ji < j 2 < ]3 < k be the lexicographically first sequence such that 1$^ | Z' 
and add to Z' (I = 1,2,3). 
2f. Let Q be the set of all (Z, Z')-endangered clauses that contain less than 3 variables from Z'. 

Note that the While-loop gets executed at most n/3 times, because Z' gains three new elements in each 
iteration. Actually we prove in Section|5]below that the final set Z' is fairly small w.h.p. 

Proposition 3.3 The set Z' obtained in Phase 2 of Fix(<&) has size \Z'\ < nk~ 12 w.h.p. 

After completing Phase 2, Fix is going to set the variables in V \ (Z U Z') to true and the variables 
in Z \ Z' to false. This will satisfy all (Z, Z')-secure clauses. In order to satisfy the (Z, Z') -endangered 
clauses as well, Fix needs to set the variables in Z' appropriately. Since each (Z, Z') -endangered clauses 
contains three variables from Z', this is essentially equivalent to solving a 3-SAT problem, in which Z' 
is the set of variables. As we shall see, w.h.p. the resulting 3-SAT instance is sufficiently sparse for the 
following "matching heuristic" to succeed: set up a bipartite graph G , ($, Z, Z') whose vertex set consists of 
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the (Z, Z') -endangered clauses and the set Z' . Each (Z, Z') -endangered clause is adjacent to the variables 
from Z' that occur in it. If there is a matching M in G($, Z, Z') that covers all (Z, Z')-endangered clauses, 
we construct an assignment crz.z'.M as follows: for each variable x s V we define 

( false if x € Z \ Z' 
oz.z 1 ,m (%) = < false if {<I>i, x} 6 M for some 1 < i < m and x occurs negatively in <I>i, 
[ true otherwise. 

To be precise, Phase 3 proceeds as follows. 

3. If G(<£>, Z, Z') has a matching that covers all (Z, Z') -endangered clauses, then compute an (arbitrary) 
such matching M and output <tz,z',m- If not, output "fail". 

The (bipartite) matching computation can be performed in 0((n + m) 3 / 2 ) time via the Hopcroft-Karp 
algorithm. In Section|6]we will show that the matching exists w.h.p. 

Proposition 3.4 W.h.p. G($, Z, Z') has a matching that covers all (Z, Z')-endangered clauses. 

Proof of Theorem 17.71 Fix is clearly a deterministic polynomial time algorithm. It remains to show 
that Fix(<&) outputs a satisfying assignment w.h.p. By Proposition 13 .41 Phase 3 will find a matching M 
that covers all (Z, Z')-endangered clauses w.h.p., and thus the output will be the assignment a = az.z 1 .m 
w.h.p. Assume that this is the case. Then a sets all variables in Z\Z' to false and all variables in V\ (ZUZ') 
to true, thereby satisfying all (Z, Z')-secure clauses. Furthermore, for each (Z, Z') -endangered clause 
there is an edge {<&.;, in M. If $>ij is negative, then o"(|$ij|) = false, and if if <&jj is positive, then 

a(<&ij) = true. In either case a satisfies 3?^. □ 



4 Proof of Proposition 1X2 



Throughout this section we let < e < 0.1 and assume that k > ko for a sufficiently large fco = fco(e). 
Moreover, we assume that m = [(1 — e)2 k k~ 1 In fcj and that n > no for some large enough no = no(e, k). 
Let w = (1 — e) In k and ki = \k/2] . 



4.1 Outline 

Before we proceed to the analysis, it is worthwhile giving a brief intuitive explanation as to why Phase 1 
"works". Namely, let us just consider the^rsf all-negative clause of the random input formula. Without 
loss of generality we may assume that i = 1. Given that 3>i is all-negative, the fc-tuple of variables 
(\&ij\)l<j<k € V k is uniformly distributed. Furthermore, at this point Z = 0. Hence, a variable x is 
Z-unsafe iff it occurs as the unique positive literal in some clause. The expected number of clauses with 
exactly one positive literal is k2~ k m ~ ton. Thus, for each variable x the expected number of clauses in 
which x is the only positive literal is k2~ k m/n ~ ui. In fact, for each variable the number of such clauses 
is asymptotically Poisson. Consequently, the probability that x is not Z-supporting is (1 + o(l)) exp(— uj). 
Returning to the clause $i, we conclude that the expected number of indices 1 < j < k\ such that 
is Z-safe is (1 + o(l))fci exp(— uj). Since uj = (1 — e) lnfc, we have 

(1 + o(l))fci exp(-w) > k E /3. 

Indeed, the number of indices 1 < j < ki so that is Z-safe is binomially distributed, and hence the 
probability that there is no Z-safe |$xj I is at most (1 + o(l)) exp(— fc e /3). Thus, it is "quite likely" that $i 
can be satisfied by setting some variable to false without creating any new unsatisfied clauses. Of course, 
this argument only applies to the first all-negative clause (i.e., Z = 0), and the challenge lies in dealing 
with the stochastic dependencies that arise in the course of the execution. 

To this end, we need to investigate how the set Z computed in Steps 1 evolves over time. Thus, we 
will analyze the execution of Phase 1 as a stochastic process, in which the set Z corresponds to a sequence 
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(Zt)t>o of sets. The time parameter t is the number of all-negative clauses for which either Step Id or le 
has been executed. We will represent the execution of Phase 1 on input <I> by a sequence of (random) maps 

n t : [m] x [k] -> {-1, 1}UVUV. 

The map 7r t is meant to capture the information that has determined the first t steps of the process. If 
n t(hj) = 1 (resp. TT t (i,j) = — 1), then Fix has only taken into account that <J?,;j is a positive (negative) 
literal, but not what the underlying variable is. If Tr t (i, j) € V U V, then Fix has revealed the actual literal 

Let us define the sequence 7r t precisely. Let Zq = 0. Moreover, let Uq be the set of all i such that 
there is exactly one j such that is positive. Further, define ito(i,j) for (i, j) G [to] x [fc] as follows. If 
i € Uq and <&jj is positive, then let no(i,j) = 3>ij. Otherwise, let iro(i,j) be 1 if $y is a positive literal 
and —1 if 3?y is a negative literal. In addition, for x E V let 

U (x) = \{i G ?7o : 3j G [fc] : 7r (i, j) = x}| 

be the number of clauses in which x is the unique positive literal. For t > 1 we define 7r t as follows. 

PI1 If there is no index i G [m] such that is all-negative but contains no variable from Z t -\, the 

process stops. Otherwise let <p t be the smallest such index. 
PI2 If there is 1 < j < k\ such that Ut-i{\$ip t j\) = 0, then choose the smallest such index; otherwise 

let j = ki. Let z t = $^ tJ - 4 and Z t = Z t -\ U {z t }. 
PI3 Let Ut be the set of all i G [m] such that is Z t -unique. For x G y let f/t(x) be the number of 

indices i G Ut such that x occurs positively in & L . 
PI4 For any G [to] x [k] let 

/ • -\ = f *ij if (* = <£t A J < fci) V | G Z t V (z G C/ 4 A n (i, ]) = 1), 
1Tt[ - %,:] ' | 7Tf_i(i,j) otherwise. 



Let T be the total number of iterations of this process before it stops and define 7r t = ttt, Z t = Zt, 
U t = U T , U t {x) = U T {x), (f> t = z t = for all t > T. 

Let us discuss briefly how the above process mirrors Phase 1 of Fix. Step PI1 selects the least index 
(j>t such that clause <&^ 4 is all-negative but contains no variable from the Zt-x of variables that have been 
selected to be set to false so far. In terms of the description of Fix, this corresponds to jumping forward to 
the next execution of Steps ld-e. Since Ut-\{x) is the number of Z t _i-unique clauses in which variable x 
occurs positively, Step PI2 applies the same rule as ld-e of Fix to select the new element z t to be included 
in the set Z t . Step PI3 then "updates" the numbers Ut(x). Finally, step PI4 sets up the map ir t to represent 
the information that has guided the process so far: we reveal the first k% literals of the current clause $^ t , 
all occurrences of the variable z t , and all positive literals of Z t -unique clauses. 

Observe that at each time t < T the process PI1-PI4 adds precisely one variable z t to Z t . Thus, 
\Z t \ = t for any t < T. Furthermore, for 1 < t < T the map 7r t is obtained from iit-i by replacing some 
±ls by literals, but no changes of the opposite type are made. Finally, for any i G [to] there is either no j 
such that TTt(i, j) = 1, or there are at least two such indices j. This is because step PI4 ensures that for any 
i such that is Z t -unique irt(i,j) equals the literal 3?^ if it is positive. 

Of course, the process PI1-PI4 can be applied to any concrete fc-SAT formula $ (rather than the 
random €>). It then yields a sequence ir t [$] of maps, variables z t [$], etc. 

For each integer t > we define an equivalence relation = t on the set f2fc(n, to) of fc-SAT formulas by 
letting $ = t ^> iff ir s [$] = ir s [$>] for all < s < t. Let Tt be the cr-algebra generated by the equivalence 
classes of = t . The family (Tt)t>o is a filtration, and the following is immediate from the construction. 

Fact 4.1 For any t > the random map 7r t , the random variables 4>t+i and z t , the random sets Ut and Z t , 
and the random variables Ut (x) for x G V are Tt-measurable. 

Intuitively, that a random variable X is Tt -measurable means that its value is determined by time t. The 
following is the key fact about the sequence Tr t (i,j). 



4 PROOF OF PROPOSITION ?? 



11 



Proposition 4.2 Let £ t be the set of all pairs such that Tr t (i,j) G {— 1, 1}. The conditional joint 
distribution of the variables (\&ij\)(i.j)e£ t gi ven Ft is uniform over (V \ Z t ) £t . That is, for any formula 
$ and for any map f from £ t [$] to V \ Z t [$] we have 

p iy(i,i) g s t m : = m = \v\z t [*] 

Proof. Let [$] ( signify the = t -equivalence class of $. Let P$ denote the conditional probability distribution 
P [• \Tt] ($)■ Then for any event X we have 

p $ [x] = p[x|[$] t ] = |[$] t nx|/|[$] t |. (3) 

That is, the conditional distribution P$ is uniform over [$] t . Hence, we just need to determine |[$L|. 
Given a map / : £ t [$] — > F \ Z t [$], we define a formula <£>/ by letting 



if {i,j) G f t [$] and7r (i,j) = -1, 

(*/)« = { f(i,j) if M 6 £ [*] and 7r (*,i) - 1, (* € H , j G [*]). 

otherwise 

Then <&/ = t <I>. Hence, we obtain a bijection (1/ \ Z t — > [$] t , / i-> $/, and thus the assertion 
follows from (O. □ 

In each step of the process PI1-PI4 one variable z t is added to Z t . There is a chance that this variable 
occurs in several other all-negative clauses, and therefore the stopping time T should be smaller than the 
total number of all-negative clauses. To prove this, we need the following lemma. 

Lemma 4.3 W.h.p. the following is true for all 1 < t < min{T, n}: the number of indices i G [m] such 
that 7r t (i, j) = —1 for all 1 < j < k is at most 2nujexp(—kt/n)/k. 

Proof. We consider the random variables 

*r f 1 if 7Tt(«, j) — — 1 and t < T, ,. , . . ,,, , _ ... 

M« = ( other l wi ^ (^M^W^o). 

Let t <n, p = [In 2 n], and let 2" C [771] be a set of size p. Let = 1 if t < T and ir t (i,j) = —1 for all 
j G [fc], and let 1^ = otherwise. Set j" = [t] X J X [fc] . If Yi = 1 for all i G J, then Af oij = 1 for all 
G I x [fc] and M S ij = 1 for all (s,i,j) G J7, and we will prove below that 



E 



n Mo*- 11 

(ij)exx[fc] (t,i,j)ej 



< 2-^1(1 -l/n)^!. (4) 



Hence, 



E 



< [2- k (l-l/n) kt Y <\^, whereA = 2- A; exp(-/l-t/?i). (5) 



Combining the bound © with Lemma [2721 we see that with probability at least 1 — n -10 there are no 
more than 2Am indices i G [m] such that 7r t (i, j) = —1 for all j G [fc]. Hence, by the union bound the 
probability that this holds for all t < min{T, n} is at least 1 — n~~ 9 . As 2Am < 2nujexp(—kt/n)/k, this 
implies the assertion. 

To complete the proof, we need to establish (0]). Let 

^0 = -Mhj. # - {(hj) ■ (t,i,j) G J}, andX t = J] My- 

(i,i)eix[fc] (»,i)6Jt 

Since the signs of the literals <&y are mutually independent, we have 

Epf ] = 2- fc l x l. (6) 
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Furthermore, we claim that 

E[* t |^i_i] < (l-l/n)^; (7) 

then (0]i follows by plugging (0 and © into Lemma l2~4l 

To prove 0, let t > 1. If T < t or 7r t _i(«, j) 7^ -1 for some (£, j) € J t , then clearly X t = Af t ij = 0. 
Hence, suppose that T > t and ir t -i(i,j) = — 1 for all (i, j) € J7t. Then at time t PI2 selects some 
variable Zt G V \ Zt-\, and J\fuj = 1 only if |«l>y| 7^ z t . As irt-i(i,j) = — 1 for all e Jt, given 
JFt-i the variables : \)(i.j)ej t are mutually independent and uniformly distributed over V \ Z t -\ by 
Proposition l4.2l Therefore, for each G J7t independently we have |3>y | = z t with probability at least 
1 jn, whence (O follows. □ 

Corollary 4.4 W/i.p. we have T < Ank^ 1 lnw. 

Proof. Let to = 2nfc _1 lnw and let 7 f be the number of indices i such that TT t (i,j) = —1 for all 1 < j < fc. 
Then PI2 ensures that 7 t < 7 t _ x - 1 for all f < T. Consequently, if T > 2t , then < I T < It ~ to, and 
thus I to > to. Since 2nk~ 1 hiuj > 3nujexp(~kto/Ti)/k for sufficiently large k, Lemma [431 entails that 

P [T > 2to] < P [1^ > *o] = P [it > S"^" 1 lnw ] < p [ito > 3nwexp(-fct /n-)/fc] = o(l). 
Hence, T < 2£o w.h.p. □ For the rest of this section we let 

6 = L4nfc- 1 lncjJ. 

The next goal is to estimate the number of Z t -unique clauses, i.e., the size of the set XJ t . For technical 
reasons we will consider a slightly bigger set: let U t be the set of all i £ [m] such that there is an index j 
such that 7To(i,i) 7^ —1 but there exists no j such that 7r t (i, j) E {1} U Z t . That is, clause <&, contains a 
positive literal, but by time t there is at most one positive literal <I>jj left that does not belong to Z t , and &i 
has no negative literal whose underlying variable lies in Z t . In Section l4~2l we will establish the following 
bound. 

Lemma 4.5 W.h.p. we have maxo<t<T |Wt| < (1 + e/3)um. 

Let us think of the variables x E V \ Z t as "bins" and of the clauses with i £ Ut as "balls". If we 
place each ball i into the (unique) bin x such that x occurs positively in then by Lemma l4~5l the average 
number of balls in a bin is at most 

(l + e/3)wn (l + e/3)w 

TW" = w - h - p - 

As w < (1 — e)lnfc and t < T < 4r7,fc _1 lnw w.h.p. by Corollary 14.41 for large enough k we have 
(1 + e/3)(l — t/n)~ 1 ui < (1 — 0.6e)lnfc. Hence, if the "balls" were uniformly distributed over the "bins", 
we would expect 

\V \ Z t \ exp(-|C7 t |/|y \Z t \) > (n- t)k Q - 6s ~ l > nk"' 2 - 1 



"bins" to be empty. The next corollary shows that this is actually true. We defer the proof to Section l43l 

Corollary 4.6 Let Q t = \{x G V \ Z t : U t (x) = 0}|. Then min t < T Q t > nk 6 / 2 ' 1 w.h.p. 

Now that we know that there are "a lot" of variables x G V \ Z t -\ such that Ut(x) = w.h.p., we can 
prove that it is quite likely that clause contains one. More precisely, we have the following. 

Corollary 4.7 Let 

B = f 1 ?fmini< j</ci Ut-i(\®<i, tj \) > 0, Qt-x > nk'/ 2 -\ \U t \ < (1 + e/3)iun, andT > t, 
\ otherwise. 

Then Bt is Tt-measurable and E [Bt\ Ft-i] < ex p(~ k e ^ 6 ) for all 1 < t < 9. 
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Proof. Since the event T < t and the random variable Qt-i are Tt—\ -measurable and as £7t_i(|<I>^ t j|) is 
Tt -measurable for any j < k\ by Fact l4. 11 Bt is ^-measurable. Let $ be such that T [$] > t, Qt-i > 
nfc a_1 , and \Ut-i [$] | < (1 + s/3)um. We condition on the event <1> =t-i Then at time t the process 
PI1-PI4 selects <\> t such that ir t -i(4>t, j) — — 1 for all j € [fc]. Hence, by Proposition 14.21 the variables 
|$0 t j | are uniformly distributed and mutually independent elements of V \ Z t -\. Consequently, for each 
j < k\ the event t/ t _i(|€>0 t j|) = occurs with probability | 1 1 / 1 ^ \ Zt-i\ > fc 6 / 2 " 1 independently. 
Thus, the probability that U t -i{\&<t, t j\) > Ofora/Z j < ki is at most (1 - fee/2-i)fci-i < eX p(-fe £ /6) n 

Proof of Proposition \3.2\ The definition of the process PI1-PI4 mirrors the execution of the algorithm, i.e., 
the set Z obtained after Steps la-Id of Fix equals the set Zt- Therefore, the first item of Proposition l3.2l 
is an immediate consequence of Corollary 14.41 and the fact that \Z t \ = t for all t < T. Furthermore, the 
second assertion follows directly from Lemma 1431 

To prove the third claim, we need to bound the number of clauses that are unsatisfied under the as- 
signment az T that sets all variables in V \ Zt to true and all variables in Zt to false. By construction 
any all-negative clause contains a variable from Zt and is thus satisfied under o~z T ■ We claim that for any 
i G [m] such that 4?^ is unsatisfied under oz T one of the following is true. 

a. There is t < T such that i G Ut-i and z t occurs positively in 4?^. 

b. There are 1 < j\ < j% < k such that = &ij 2 - 

To see this, assume that b. does not occur. Let us assume without loss of generality that . . . , &u are 
positive and 4>jj+i, . . . , &ik are negative for some I > 1. Since 3>i is unsatisfied under oz T , we have 

. . . , G Zt- Hence, for each 1 < j < I there is tj < T such that <&ij — z tj . As <&ji, . . . , 
are distinct, the indices t\, . . . , t; are mutually distinct, too. Assume that t\ < • • • < t\, and let to = 0. 
Then contains precisely one positive literal from V \ Z tl _ 1 . Hence, i G Ut l _ 1 . Since <I>i is unsatisfied 
under az T no variable from Zt occurs negatively in <frj and thus i G U s for all t;_i < s < t%. Therefore, 
i G Uti~\ and z tl = <f>a, i.e., a. occurs. 

Let X be the number of indices i G [m] such that a. occurs. We claim that 

X < ncxp{-k e/7 ) w.h.p. (8) 

Since the number of i G [to] for which b. occurs is 0(lnn) w.h.p. by Lemma l2~3l (H]l implies the third 
assertion. 

To establish ©, let Bt be as in Corollary 14. 7l and set 

v = f Ut-i(z t ) if B t = 1 and U t -i{z t ) < In 2 n, 
1 otherwise. 

Then by the definition of the random variables Bt, T) t either 

X < ]T V t (9) 

l<t<0 

or one of the following events occurs: 

i. T > 9. 

ii. Q t < nk 6 ' 2 - 1 for some < t < T. 

iii. \U t \ > (1 +e/3)ujn for some 1 < t < T. 

iv. \Ut-i(z t )\ > In 2 n for some 1 < t < 6. 

The probability of i. is o(l) by Corollary 14.41 Moreover, ii. does not occur w.h.p. by Corollary 14.61 and the 
probability of iii. is o(l) by Lemma 1431 If iv. occurs, then the variable z t occurs in at least In 2 n clauses 
for some 1 < t < 6, which has probability o(l) by Lemma [2~3l Hence, (O is true w.h.p. 

Thus, we need to bound J2i<t<e ^" ne ran dom variable T) t is ^-measurable and T) t — for all 
t > 6. Let V t = *E\D t \Tt-i] and M t = J2l=i v s - V s . Then (M t )i<t<e is a martingale. As all 
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increments V s — V s are less than In 2 n in absolute value by the definition of T> t , Lemma [23] (Azuma- 
Hoeffding) entails that Aig = o(n) w.h.p. Hence, w.h.p. we have 

Y, -D t = o(n)+ Y V t . (10) 
i<t<e i<t<e 

We claim that 

V t < 2wexp(-fc e/6 ) for all 1 < t < 6. (11) 

For by Corollary 14. 71 we have 

E[B t |^t-i] <cxp(-fc £ / 6 ). (12) 

Moreover, given Tt-\ we have TTt-i {4>t, ki) = — L whence Zt is uniformly distributed over V \ Z t -i by 
Proposition [4j2] Since Bt = 1 implies |Z7t-i| < (1 + e/3)ujn, this means that the conditional expectation 
of Ut-i(zt) is at most 

\U^\/W\Z^\< {1+£/ f n <2u J . (13) 
n — t 

Combining ( [T2l and ( [T3l , we obtain (fTTb . Further, plugging dTTb into ( TTOb . we get 

Y v t = 2u;cxp(~k £/2 /3)6 + o(n) < 3w cxp(-fc e/6 )(9 < ncxp(-/c e/7 ) w.h.p. 
i<t<e 

Thus, © follows from ©. □ 

4.2 Proof of Lemma [431 

For integers t > 1, i G [m], j G [fc] let 

Jl if TT t -i(i,j) = 1 and7r t (i, j) = z t fl if T > t and 7r t (i, j) G {1, -Lf 

\ otherwise, tij \ otherwise. K ' 

Lemma 4.8 For any two sets I,Jc [8] x [to] x [k] we have 



E 



(t,i,j)ex (t,i,j)ej 



< (n-ey 111 (i - i/ n ) lJl 



Proof. Let I t = : (t,i,j) G X}, J t = : (t,i,j) G J}, X t = U i:U) ei t Htij T\( hMJt S tij . 

If X t = 1, theni < T (as otherwise Suj = by definition and Huj = because ir t = 7r t _i). Furthermore, 
X t = 1 implies that 

■K t -i{i,j) = 1 for all (ij) el t and ir t -x{i,j) G {-1,1} for all (ij) G J t . (15) 

Thus, let $ be a fc-SAT formula such that T [$] > t and 7r t _i [$] satisfies (Tl3T> . We claim that 

^[X t \T t _ x ]{^)<{n-d)-\^\{\-l/n)^. (16) 

To show this, we condition on the event 3? = t $. Then at time t steps PI1-PI2 select a variable z t from 
the the all-negative clause $^ t . As for each (i, j) G 2" t clause <&i contains a positive literal, we have 
</>t ^ i. Furthermore, we may assume that if (<frt,j) G i/t then j > fci, because otherwise X t = Stfaj = 
(cf. PI4). Hence, due to ( TT3T > and Proposition l4.2l in the conditional distribution P [-| J r t _i] ($) the variables 
(\&ij\)(i,j)£T t uj t aie uniformly distributed over V\Z t ^\ and mutually independent. Therefore, the events 
|3>jj | = z t occur independently with probability 1/| y \ | = l/(n — t + 1), whence 

E [Xtl^-i] ($) < (?i-i + l)- |It| (l-l/(?i-i + l)) |Jt| < (n-0) _|It| (l - l/n) |Jt| . 

This shows d!61 l. Finally, the assertion follows from Lemma l2~4l and (1161 1. □ Armed with Lemma l4~8l we 
can now bound the number of indices i G Ut such that $i has "few" positive literals. 
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A;(i) = w 



/-l 



(1 - t/n) 



fc-z 



Lemma 4.9 Wi'f/z probability 1 — o(l/n) the following is true for all 1 < I < one/ all 1 < t < 
min{T, 0}. Le/ 

'fc- 1\ /t 

There are at most (1 + e/9)A;(i)n indices i <E Ut such that <I?j /zas precisely I positive literals. 

Proof. Let Af C [to] be a set of size = [in 2 n\ and let Pj C [A;] be a set of size / — 1 for each i 6 M. 
Let V = (Pi)i£M be the family of all sets Pj. Furthermore, let tj : Pj — > [t] for all i € At\ and let 
T = (tj)ieAI comprise all maps i,;. Let £m{V, T) be the event that the following statements are true: 

a. &i has exactly I positive literals for all i G Af . 

b. $y = z ti (j) for alH e A1 and j G Pj. 

c. t < T and no variable from Z t occurs negatively in 
Moreover, let 

1 = l M (V,T) = {(s,i,j):i€M,j€P i ,s = t i (j)}, 
J = JM(V,'T)={(s,i,j):i€M,je[k)\P i } 

Let Yi = lif clause has exactly / positive literals, including the literals $y for j G Pi (i G Al). Then 
P [Yi = 1] = (fc — £ + l)2~ fc for each i G AL Moreover, the events Y'i = 1 are mutually independent and 
.Po-measurable. Therefore, by Lemma H~8l 



n • u s tij\fo 

(t,i,j)ei (t,i,j)ej 



P[^(P,T)] < E 




• E 












"A 


-l + l 


■ (n - 


< 




2 k 



(17) 



Let £m be the event that t < T and 3?; has exactly I positive literals and i G lit for all i G Af. If £At 
occurs, then there exist P,T such that £_m(V,T) occurs. Furthermore, for each i G Af there are (.J 
ways to choose a set Pj and then t 1 ^ 1 ways to choose the map £,;. Therefore, the union bound and (T% 
yield 

P [£ M ] < Yl P ^ ^' T )l - A " where 

& 



A = 



/^x^tl^n-^-'d-l/n)^ 1 )*. 



Hence, by Lemma [2721 with probability 1 — o(l/n) there are at most (1 + o(1))Ato indices i e [to] such 
that *&; has precisely I positive literals and i G Lit- Thus, the remaining task is to show that 



Am < (l+e/10)Ain. 

To show (fT8T l. we estimate 

A s * 2 - t -C:i 1 )(^) M(i - i/ ">' ( '" 1 " < '" 11) 



(18) 



< kT 



-1A /* 
We can bound r] as follows: 



Z-i 



(1 - ?;/n) fc ~ 1 ~ (i ~ 1) 7?, where 77 



n — t 



(1 - W 
1 - t/n 



(-19) 



< (l + t/(n-t)y 



exp(— t/n) 



k-l 



^exp(— t/n — (t/n) 2 ) 
< exp(2W/n + k(6/n) 2 ) < exp(8Zfc" 1 In a; + 16AT 1 In 2 uj) 



< (1 + 2t/n) exp(fc(i/n) 2 ) 
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Since I < \f~k~ and u < Ink, the last expression is less than 1 + e/10 for sufficiently large k. Hence, 
77 < 1 + e/10, and thus < fl~8T > follows from il9[ . □ The following lemma deals with i E U t such that <I>; 
contains "a lot" of positive literals. 

Lemma 4.10 W.h.p. the following is true for all I > Ink. There are at most nexp(— I) indices i G [to] 
such that €>; has exactly I positive literals among which at least I — 1 are in Zg. 

Proof. Let M C [to] be a set of size fi = [in 2 nl and let Pj C [k] be a set of size / — 1 for each i G M. 
Furthermore, let £j : Pi — ► [6*] for all i e AL and set T = Let £jn(P,T) be the event that the 

following two statements are true for alH G M : 

a. <I>; has exactly I positive literals. 

b. For all j G Pi we have ffr^ = z^y). 

Let Em be the event that for alH G M clause $i has exactly I positive literals among which I — 1 are in 
Zg. If £m occurs, then there are P, T such that the event Em (P, T) occurs. 

For i G .M we let Yi = 1 if clause <& ; has exactly I positive literals, including the literals <&y for j G Pj. 
SetX = {(«,», j) : i G M,i G P,s = If E M {V,T) occurs, then H, >( ... z 'tt.,, • ELeM X = 1- 

Bounding E [IlieA-i ^'J as m tne P ro °f °f Lemma |4~9"1 and applying Lemma l4~8l we obtain 



P[^(P,T)] < E 




•E 




< 


'jfc 








_(s,i,j)ez 







(n-0) 



Hence, by the union bound 

P [£m] < P PP,T : E M {V,T) occurs] < J^P [^(P,T)] < A M , where 



A 



k 

I - 1 



3 ;-i 



k-l + 1 



(n-6) 



i-i 



(20) 



Lemma [2721 implies that w.h.p. there are at most 2Ato indices i G [to] such that 3>i has exactly ^ positive 
literals of which Z — 1 lie in Zg. Thus, the estimate 



2Ato < 

< 
< 



2 fc+1 wn 
k 

2tun ■ 



k 

I - 1 
ekO 



k-l + 1 



(l-l)(n-6) 
nexp(— /) 



2 fc 
i-i 



< 2wn 



121nc 



i-i 



[as = 4nfc 1 lnw] 
[because i > In k] 



completes the proof. □ 
Proof of Lemma \4~5\ Since T < 9 w.h.p. by Corollary 14.41 it suffices to show that w.h.p. for all < t < 
min{T, 9} the bound \lit\ < (1 + e/3)um holds. Let Uu be the number of indices i G lit such that has 
precisely I positive literals. Then by Lemmas |4. 1 01 and 14.91 w.h.p. for all t < min{T, 9} and all 1 < I < k 
simultaneously 

nexp(— k) if I > s/k, 

(1 + e/9)Aj(i) otherwise. 



Therefore, w.h.p. 



max \Ut | 

O<t<min{T,0} 



< 



^^JAti < nfcexp(— k) + 



0<t<min{T. 



1=1 



<t<min{T,0} 



fe 

E 

l<l<Vk 



(l + e/9)A,(t) 



< n+(l + e/9)wn<(l+£/3)wn, 



as desired. 



□ 
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4.3 Proof of Corollary I£S1 



Define a map tp t ■ Ut — * V as follows. For i <E Ut let s be the least index such i <E U s \ if there is j such 
that <&,y £ V \ let ij>t(i) = ^iji an d otherwise let ipt(i) = z s . Thus, if i £ J7 S then is the unique 
positive literal of that does not belong to Z s . The following lemma shows that the (random) map tjj t is 
not too far from being "uniformly distributed". 

Lemma 4.11 Lett > 0, U t C [m], and^ t :U t ^V. Then P Vt = $t|W t = U t < (n - 
Proof. Set Z_i = 0. Moreover, define random variables 



n t (i,j) if 7r t (i,j) £ {-1,1} 
otherwise 



for £ [m] x [fc] 



Thus, j t is obtained by "forgetting" the literals ir t (i,j) £ VU7 that the process PI1-PI4 has revealed up 
to time t. Observe that for any i £ [m] 



i eU t max7o0, j) > A (Vj £ [fc] : 7t (z, j) = min{7o(i, j), 0}) 



(21) 



Fix a set Zij C [m], let <I> be any formula such that U t [$] = Uu and let j t = It [$] ■ For s < i let r s be 
the event that j u = j u for all u < s. The goal is to prove that 



Let t : Ut — ► [0, t] assign to each i £ U t the least s such that i £ U s . We claim that 

P [Vi £ tf t : ^t(») = &(*)|rt] < (n - r(z))" 1 . 



(22) 



(23) 



Since r(i) < i for all j £ W t , (EH implies (1221 . 

Let r s be the event that %l> u (i) = V't(*) f° r all < u < s and all i £ t~ 1 (u), and let r_i = £lfc(n, m). 
In order to prove d23l , we will show that for all < s < t 



P[r s |r s _inr 5 ] < ( n - s )-l T_1 WI 

p[r s |r s _inr 5 ] = p[T»|r._inr t ]. 



and 



(24) 
(25) 



Combining (1241 and ( f25T > yields 



ViGWt : it> t {i) = ^ t (i)\T t 



P[r t |r t ]= ' [ P[r s |r s _inr t ] 



[ PIt.It._x nr.] < [ (n-s) 



0<s<t 



0<s<t 



which shows (l23l . Thus, the remaining task is to establish d24l and d25l . 
To prove (1241 it suffices to show that 



p[T,nr,|y,-i] (y) 
p[t s _! nr s |^ s _!](^) 



< (n - s)~ |t 1(s)l for all p £ t s _i n r.. 



(26) 



Note that the l.h.s. is just the conditional probability of t s given t._i n T s with respect to the probability 
measure P [- |_ C7 s _i] (<p). Thus, let us condition on the event _• =._i f £ t._i n T s . Then <_ £ T s , and 
therefore 70 = 70 and 7. = 7s . Hence, (f2 fTT > entails W s = £4 [<p] = U s [$], and thus r _1 (s) C U s . Let 
i £ t _1 (s), and let J,; be the set of indices j £ [k] such that 7._i(i, j) = 1. Recall that ^.(i) is defined 
as follows: if <_jj = z s for all j £ J^, then ij) s {i) = z s ; otherwise tp s (i) = _'y for the (unique) j £ J, 
such that 7^ 2.. By Proposition l4.2l in the measure P [-l^-i] (y) the variables (3?ij)»eT - 1 («),jeJi are 
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independently uniformly distributed over V \ Z a -\ (because 7r s _i(i,j) = j s -i(i,j) = 1). Hence, the 
events i/j a (i) = i>t(i) occur independently for all i G t _1 (s). Thus, letting 

Pi = P[^(»)=V*(»)AVje J t : 7s (i,j) = 01^] (tp), 
q t = P^/j'S J i :i a (i ) j) = 0\f s - 1 }(<p) 

for i G r -1 (s), we have 

P[r s nr s |^_!] ( y ) n pi 

iGT i (s) 

Observe that the event Vj G J, : J s (i>j) = occurs iff <&jj = z s for at least |Jj| — 1 elements j G Jj 
(cf. PI4). Therefore, 

qt = \j t \-\v\z s ^\-^- 1 \i~\v\z s - l \~ 1 ) + \v\z s _ l \-\^ 

To bound pi for i G r _1 (s) we consider three cases. 

Case 1: ?AtM As <J>.y e \ Z s _i for all j G J; the event %j) a {i) = i>t{i) nas probability 0. 

Case 2: V>t(i) = z s . The event = V't(i) occurs iff <J?.y = z s for all j G Ji, which happens with 

probability \V \ Z s _i|~l J *l in the measure P [-^s-i] (</?)■ Hence, p., = (n — s + l)~ |Ji| . 

Case 3: 4>t{i) G V \ Z s . If "0s W = $t(*)> then there is j G J, such that = V>t(i) and $< 3 v = z s for 
all / G J.\{j}. Hence, Pl = |J,| • |V\ Z._i|-l J 'l = | Ji|(n - s + 1)H J < 

In all three cases we have 

ft UKn-s+i^-i^iq-i/^-s + i)) 

Pi - |Ji|(n-* + l)-W " S ' 

Thus, (EU) follows from (f2Tb . 

In order to prove dZSb we will show that 

p[r a |r fc nr c ] =P[r a |r c ] (28) 

for any < b < c < a. This implies d25l l as follows: 

P[T s nr t ] p[r t |r s nr s ]p[r s nr s ] 



P[r s |r s _inr t ] 



p [r s _i n r t ] p [r t |r s _! n r s ] p [ Ta _i n r s 
» P[Tsnrs] -PWv.nr,]. 



p [r,_i n r s ] 

To show (l28l l it suffices to consider the case a = c + 1, because for a > c + 1 we have 

p[r a |r b nr c ] = p[r a |r b nr c+1 ]P[r fc nr c+1 |r b nr c ] 
= p[r Q |r b nr c+ i]P[r c+ i|T b nr c ]. 

Thus, suppose that a = c + 1. At time a = c + 1 PI1 selects an index <j> a G [m]. This is the least index 
i such that "y c (hj) = — 1 f° r a H i; thus, a is determined once we condition on T c . Then, PI2 selects a 
variable z a = \^cj, a j a | with j < k\. Now, 7 a is obtained from j c by setting the entries for some (i, j) such 
that 7c(i,i) G { — 1, 1} to (cf. PI4). More precisely, we have 7a(^ a ,i) = for all j < fci. Furthermore, 
for i G [m] \ {<j> a } let J7i be the set of all j G [k] such that ir a (i,j) = 7a(hj) € { — 1, 1}, and for i = <fr a 
let J7j be the set of all fci < j < k such that Tr a (i,j) = 7 a (i,j) G { — 1, 1}- Then for any i G [m] and 
any j G J7i the event J c (i,j) = only depends on the events | = z a for j' G Ji. By Proposition [472 
the variables (\$ij> \)ie[m],jeJi aie independently uniformly distributed over V \ Z c . Therefore, the events 
\&ij> I = z a for j' G Ji are independent of the choice of z a and of the event r b . □ 
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Proof of Corollar\ \4.6\ Let /i < (1 + e/3)ijjn be a positive integer and let U t C [m] be a set of size \i. 
Suppose that t < 6. Let v = nk~ s / 2 , and let B be the set of all maps ip : U t — * M such that there are less 
than i/ + 1 numbers a; G [n] such that t/j^ 1 (x) = 0. Furthermore, let Bt be the event that there are less than 
v variables x £ V \ Z t such that Ut (x) = 0. Since \Z t \ = t, we have 



Bt\U t =Ut] < X) P [^*=V'|Wt 



n — i 



51 



[by Lemma |4~TTI 



M Ipl I ol 

< ■ exp(26^/n) < l -J- • exp(9nfc" 1 In 2 k). (29) 



Furthermore, \B\/n tl is just the probability that there are less than v empty bins if \i balls are thrown 
uniformly and independently into n bins. Hence, we can use Lemma |2~T1 to bound \B\n~ tl . To this end, 
observe that because we are assuming e < 0.1 the bound 



2e e 2 

holds, where a = > 0.6e. 

3 3 ~ 



\B\n~^ < P [Z{n,n) < exp(-/i/n)n/2] 

< 0(\/n) exp [— exp(— n/n)n/8] < exp [-k^n/S] . 



exp(-/i/n) > exp(-(l + e/3)w) = fc Q_1 
Therefore, Lemma [2~T1 entails that 



(30) 



Combining (1291 and 



P* = P 



we see that 



B t \U t = U t :U t C [m] , \U t \ = \x < exp [nfc" 1 (9 In 2 k - k a /9)] = o(l/n). 
Thus, Corollary 14.41 and Lemma |431 imply that 

P [3t < T : \{x G V \ Z t : U t {x) = 0} < 



< P[T > 



max IW+I > (1 + e/3)ujn 

0<t<T 



0<t<8 



as desired. 



□ 



5 Proof of Proposition 133 



Let < e < 0.1. Throughout this section we assume that k > ko for a large enough fco = fco(£)> an d 
that n > no for some large enough uq = no(e, k). Let m = [_(1 — e)2 k k^ 1 lnfcj, uj = (1 — e) Ink, and 
k\ = \k/2~\. In addition, we keep the notation introduced in Section |4~T1 



5.1 Outline 

Similarly as in Section [4] we will describe the execution of Phase 2 of Fix(<&) via a stochastic process. 
Recall that T denotes the time when the process PI1-PI4 from Section|4](i.e., Phase 1) stops. Let Z' Q = 
and 7r = ttt- Let U' = Ut, and let U' {x) be the number of indices i G U' n such that x occurs positively 
in Moreover, let Q' be the set of indices i G [to] such that <& ; is unsatisfied under az T - For t > 1 we 
proceed as follows. 

PI1' If Q' t _! = 0, the process stops. Otherwise let ip t = min(5j_i. 

PI2' If there are three indices hi < j < k - 5 such that Tr' t _ 1 (tp t , j) G {1, -1} and E/ t '_i(|$Vt.j'l) = 0' 
then let fci < ji < j2 < J3 < fc— 5 be the lexicographically first sequence of such indices. Otherwise 
let fc — 5 < ji < j2 < J3 < k be the lexicographically first sequence of indices k — 5 < j < k such 
that §^ Let Z t ' = Zf_! U | : i = 1, 2, 3}. 
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PI3' Let U[ be the set of all i G [m] that satisfy the following condition. There is exactly one Z G [fc] such 
that $,,el/ \ U Z T ) and for all j ^ / we have * y G Z T U Z[ U V"\ Z T . Let U[(x) be the 
number of indices i G ?7/ such that x occurs positively in 4?; (x G V). 

PI4' Let 

< = f *y if (i = i> t A ] > fci) V G Z t ' U Z T V (i G C/ 4 ' A 7T (*,i) = 1), 
" 1 7Tj_ 1 (i,j) otherwise. 

Let Qj be the set of all (Zt, Z' t ) -endangered clauses that contain less than three variables from Z' t . 

Let T' be the stopping time of this process. For t > T' and x G V let 7r£ = 7r^,, £/ t ' = f/^,,, Z' t = Z' T ,, and 
L/ t '(x) = U T .{x) . 

We define an equivalence relation =' t by letting $ =J <3> iff $ = s ^ for all s > 0, and 7r£ [$] = 7Tg [\&] 
for all < s < t. Let T[ be the er-algebra generated by the equivalence classes of =' t . Then {T' t )t>o is a 
filtration. 

Fact 5.1 For any t > the map ir'p the random variable il>[ + \, the random sets U[ and Z[, and the random 
variables Uj.(x) for x G V are ^-[-measurable. 

The same argument that we used to prove Proposition l4.2l in Section l4Tl shows the following. 

Proposition 5.2 Let £' t be the set of all pairs such that 7Tt(i,j) G {±1}. The conditional joint 
distribution of the variables (|$y \)ti,j)eSt given T' t is uniform over (V \ Z' t ) *. 

Let 

6' = Lcxp(-fc £/16 )nJ, and recall that 9 = \Ank~ 1 lnwj . 

To prove Proposition [33] it is sufficient to show that T' < 8' w.h.p., because \Z' t \ = 3t for all t < T' . To 
this end, we follow a similar program as in Section |4T1 we will show that \U' t \ is "small" w.h.p. for all 
t < 6', and that therefore for t < 6' there are plenty of variables x such that Uj.(x) = 0. This implies 
that for t < 8' the process will only "generate" very few (Zt, Z' t ) -endangered clauses. This then entails a 
bound on T', because each step of the process removes (at least) one (Zt, Z' t ) -endangered clause from the 
set Q' t . In Section fOl we will infer the following bound on \U' t \. 

Lemma 5.3 W.h.p. for all t < 6' we have \U' t \ U T \ < n/k. 

Corollary 5.4 W.h.p. the following is true for all t < 9': there are at least nk e ^~ x variables x G V \ 
{Z' t U Z T ) such that U[(x) = 0. 

Proof. By Corollary 14.61 there are at least nfc £ / 2_1 variables x G V \ Zt such that Ut(x) = w.h.p. 
Furthermore, by Lemma 1531 we have \U' t \ Ut\ < n/k w.h.p. Moreover, \Z' t \ < 3t. Hence, w.h.p. the 
number ofx€V \ (Z' t U Z T ) such that U' t (x) = is at least nk 6 ^- 1 - n/k -39' > nk e / 3 ~ l . □ 

Corollary 5.5 Let y be the set of all t < 9' such that there are less than 3 indices k\ < j < k — 5 such 
thatix't-ityuj) 6 {-1, 1} andU't_ x (\$^ tj \) = 0. Then \y\ < 38' exp(-fc £ / 4 ) w.h.p. 

We defer the proof of Corollary 15. 5l to Section l531 Furthermore, in Section l5~4l we will prove the following. 

Corollary 5.6 W.h.p. the total number of (Zt, Z' e , ^-endangered clauses is at most 8' . 

Proof of Proposition ^. 3\ We claim that T' < 8' w.h.p.; this implies the proposition because \Zt'\ = 3T'. 
To see that T' < 9' w.h.p., let Xq be the total number of (Zt, Z^,)-endangered clauses, and let X t be the 
number of (Zt, Zg,)-endangered clauses that contain less than 3 variables from Z' t . Then the construction 
PI1'-PI4' ensures that < X t < X - t for all t < T . Hence, V < X , and thus the assertion follows 
from Corollary 1331 □ 
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5.2 Proof of Lemma 15.31 

Let Huj , Suj be as in (fl4l i and let in addition 

K / = fl if* , t _ 1 (i,j) = l,* , t (i,j)eZ , t ,wiT<0, 
t% 3 1 otherwise. 

Lemma 5.7 For any X' c [<?'] x [m] x [fc] w wave E II( t ,;.j) e z' Kijl^o < ( 3 /( n - - 30'))' 1 ' 1 . 
Proof. LetXj = {(i,j) : (t,i,j) E X'} and X t = Y[(j j)cz' ^'tij- ^ ue t0 Lemma l2~4l it suffices to show 

E [X t \F' t _^\ < (3/(n - - 3e»')) |X *' for all f < 0'. (31) 

To see this, let 1 < t < 6' and consider a formula $ such that T[$] < 6,t < T [$], and 7r£_x(i, j) [$] = 1 
for all G 2|. We condition on the event $ =' t _ 1 $. Then at time t steps PI1'-PI2' obtain Z t ' by 
adding three variables that occur in clause 3ty t , which is i?t_ ^-endangered. Let (i, j) G 2|, Since 
$ =t_i $ and 7r t _i(i, j) [$] = 1, the literal ^ Z T U is positive, and thus <&, is not (Z T ,Z[_^)- 
endangered. Hence, ip t ^ i. Furthermore, by Proposition l5.2l in the conditional distribution P M^t-x] (^) 
the variables (3?j?)(j,j)ez' ^ independently uniformly distributed over the set V \ (Zt U Z' t _ x ). Hence, 

P G ^|^_x] [*] = 3/|V \ (Zr U for any (i, j) G T[, (32) 

and these events are mutually independent. Since \Zt\ = n — TandT < 0, and because \Z' t _ l \ = 3(t — 1), 
(1321 implies (ED- □ 

Lemma 5.8 Let 2 < I < Vk, 1 < I' < I - 1, 1 < * < 0, and 1 < t' < 0'. For eacn i G [m] /ef Xi = lif 
T > t, T' > if, and the following four events occur: 

a. &i has exactly I positive literals. 

b. V of the positive literals of <&i lie in Z[, \ Z t . 

c. I — V — 1 of the positive literals of ^i lie in Z t . 

d. No variable from Z t occurs in 3>i negatively. 

Let 



„3'k\ 1 ' fk-V-l\ /A w ' _1 , 
■'>\ 



k-l 



Then P [YZ i Xi > B(l, I', t)] = o(n- 3 ). 

Proof. We are going to apply Lemma [2~2l Set fi = [In 2 n\ and let M. C [m] be a set of size /i. Let 5m be 
the event that Xj = 1 for all i G .M. Let Pj C [fc] be a set of size i, and let Hi, H[ C P, be disjoint sets 
such that |Fj U P-| = I - 1 and |P|| = V for each i G M. Let P = (P, P.;, H[) if z M . Furthermore, let 
U : Hi — > [i] and i< : P? — > [if] for all i G M, and set T = (U, if^i^M- Let (P, T) be the event that 
T > t,T' > if , and the following statements are true for all i G A4: 

a'. The literal $y is positive for all j G P, and negative for all j G [fc] \ P;. 

b'. $ij G \ Z' t , {i) _ x for all i G M and j G P|. 

c'. &ij = Zt t (j) for all i G and j G P;. 

d'. No variable from Zt occurs negatively in 



5 PROOF OF PROPOSITION ?? 



22 



If £m occurs, then there exist (V,T) such that £_m(V,T) occurs. Hence, we are going to use the union 
bound. For each i 6 [M] there are 



ways to choose the sets Pj, Hi, H[. 



t' 1 ways to choose the map tf., and t l 1 1 ways to choose the map ij. 



y l,l',l- V - 1 
Once these are chosen, there are 



Thus, 



P[£m] < ^2P[S M (V t T)]< 



1,1', I- I' - 1 



maxP [Sm(P,T)]. (33) 



Hence, we need to bound P [Em (V,T)] for any given V , T. To this end, let 

1 = l(M,V,T) = {(s,i,j):ieM,j€P i ,s = t i (j)}, 
J' = l'(M,V,T) = {(s,i,j):i€M,j€P;,s = t' i (j)}, 
J = J(M,P,T) = {(s,i,j):iGM,j £{k]\(P t UP[),s<t}. 

If £m{P i'P) occurs, then the positive literals of each clause <&i, i <G Ai, are precisely <&jj with j G Pi, 
which occurs with probability 2~ k independently. In addition, we have 7i s ij = 1 f° r a H (s,i,j) € Z, 
TL'sij = 1 for all (s, i, j) £ I' , and S S ij = 1 for all (s, i,j) <E J . Hence, by Lemmas l4.8l and [5TTI 



P[£ M (V,T)} < 2- fc "-E 



(t.ij)ei' (t,i,j)ei {t,i,j)ej 



< 2 



n-9-39 

Combining (l33l and d34l i. we see that P < A M , where 



(n — 



(1 - l/n) (fc -° tM . 



(34) 



A = 2 



-ft 



l,l',l-V - lj \n-9-39' 
whence Lemma l2~2l yields P E"Li Xi > 2Am] = o(n~ 3 ). Thus, the remaining task is to estimate Am: 



Am = mk2 k 



.fk-l 
V 



3f 



V 



k-V -I 



t 



< cun ■ 



69' k 



71-9-39' J \l - V - I) \ n 



l-l'-X 



(1 - l/n) (k - 1 ^ 



where 



(35) 



l—l—X / / A A i \f\ k—l 

1 -t/n 

i-l'-i 



< 1 + expffci 2 /™ 2 ) < exp(26l/n + k9 2 In 2 ). 

V n-ej 

Since 6* < 4fc~ 1 rt In and 2 < vfc, we have 77 < 2 for large fc. Thus, the assertion follows from d35l l. □ 

Lemma 5.9 Let In fc < I < fc, 1 < 2' < I, 1 < £ < 9, and 1 < f < 9'. For each i e [m] let Y % = 1 if 
T > t, T' > t', and the following three events occur: 



a. #i /ifli exactly I positive literals. 
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b. V of the positive literals of <&i lie in Z' t , \ Z t . 

c. I — V — 1 of the positive literals of <&i lie in Z t . 
Then P [J2T=i Y i > nexp(-l)] = o(n~ 3 ). 

Proof. The proof is similar to (and less involved than) the proof of Lemma \57)\ Set \i = [In 2 n\ and let 
M. C [to] be a set of size p. Let Em be the event that Yi = 1 for all i e [M], Let Pj C [k] be a set of 
size I, and let H t , H[ C Pi be disjoint sets such that |Pj U H^ = I — 1 and |P,-| = /' for each ieM. 
Let P = (P h Hi, H<) ieM . Furthermore, let U : Hi -> [i] and i< : P^ -> [f] for all i G M, and set 
T = (ti , t 9 i£M ■ Let £>i (P, T) be the event that T > t,T' > t', and the following statements are true for 
all i e M: 

a'. &ij is positive for all j G Pj and negative for all j ^ P^. 
b'. G Z' t , U) \ Z' t , {i) _ x for aU % G M and j G H[. 
c'. $y = z t .(j} for all i G M. and j G P;. 
If Em occurs, then there are (V,T) such that Em (P, T) occurs. Using the union bound as in (1331 . we get 



V,T 



1,1' I- I' - 1 



maxP[S M (V,T)]. (36) 



Hence, we need to bound P [Em (P, T)\ for any given P, T. To this end, let 

I = l(M,P,T) = {(s,i,j)-.ieM,j ePi,s = ti(j)}, 

I' = l'(M,P,T) = {(s,i,j):i€M,j€P' i ,s = ti(jy}. 

If £m(P, T) occurs, then the positive literals of each clause <J>; are precisely with j G P (i 6 M). In 
addition, H' si j = 1 for all (s, i, j) G P and 7i' ai j = 1 for all (s, i, j) G P. Hence, by Lemmas |4. 81 and 15^71 



P[£ M (V,T)] < 2- k »E 



(Mj)ex' (M,j)ex 



< 



Combining d36l l and d37l i, we see that P [£jvi] < A M , where 



A = 2- k 

< vr k 

< vr k 



1,1',1-V - 1 

k - 1 
I' 



31? 



3f 



n-9 -39 
i 



n — 



36" 



i-i'-i 



i-i'-i' 



ii 

07) 



n-6-36' 
Gkd'V fe(k-l' - 1) 



(I -I'- l)n 



fc-r - 1 
/ - /' - 1 
i-i'-i 



(38) 



Invoking Lemma [2~2l we obtain P ElUi > 2Am] = o(n 3 ). Thus, we just need to show that 2Am < 
exp(— l)n. Since 9/n < Ak" 1 lnw and < k~' 2 , in the case /' > 1/2, (fJUl yields 

Am < um (4elnw • 0'/nf /2 < exp(-l)n/2. 
Furthermore, if I' < 1/2, then we obtain from I 



Am < wnexp(— 21') (lOelnw/Z) 
Hence, in either case we obtain the desired bound. 



l-l'-i 



< exp(-l)n/2. 



□ 
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Proof of Lemma \5J\ Let X(l, I', t, t') be the number of indices i 6 [to] such that <J>; satisfies a.-d. from 
Lemma EH if t < T and t' < T' , and set X(l, /', t, H) = if t > T or i' > T. Let 5 be the event that 
T < 9 and X(l, I', t, t') < B(l, I', t) for all 2 < I < y/k, 1 < I' < I - 1, t < 9, and f < 0'. Then by 
Corollary I4.4l and Lemma EH 

P [-.£] < P [T > 0] + fc00' • o(n~ 3 ) = o(l). (39) 

Let be the number of indices i £ Ut> \ Ut and <I>, has precisely Z < \/& positive literals. If i has these 
properties, then i satisfies the condition a.-d. from Lemma [5~8l for t = T and some 1 < V < I. Therefore, 



\U t >\U T \<J2li. (40) 



i=i 

If the event £ occurs, we have 

E 7 < ^ E E^^-^EE^ 1 ' 7 ) 

l<Z<\/fc l<;<VE i/=1 ' =1 '' = 1 



< 



n 



4wnV ( ) < 5wn < n/fc 2 [because 0' < n/fc 4 ]. (41) 

' \ n I n 



v=i 



Furthermore, by Corollary 14. 41 and Lemma [5T9l we have 

^2 II - E cx P(-0" < n / k2 wh P- C 42 ) 

v / fe<i<fc Vk<l<k 

Thus, the assertion follows from (I39l- (l42l i. □ 

5.3 Proof of Corollary 15.51 

As a preparation we need to estimate the number of clauses that have contain a huge number of literals 
from Z t for some t < 9. 

Lemma 5.10 Let t < 9. With probability at least 1 — o(l/n) there are no more than ncxp(— k) indices 
i G [m] such that \{j : ki < j < k, G Z t }\ > k/4. 

Proof. For any i E [to] , j £ [k] , and 1 < s < 9 let 



7 ■■ — 



1 if = z s , 7r s _i(i,i) G {-l,l},ands < T, 
otherwise. 



Then for any setX C [t] x [to] x ([k]\ [ki]) we have 

< (n-0)- |x| . (43) 



E 



To see this, let T s = {(«', j) : (s, i, j) G 1} and set Z s = JT^ -Z S ij- Then for all s < 9 the random 
variable Z s is ^-measurable by Fact 14. II Moreover, we claim that 



V[Z s \F s -i]<{n-9)-\ x °\ (44) 
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for any s < 9. To prove this, consider any formula $ such that s < T [$] and 7r s _i(i, j) [$] 6 {—1, 1} 
for all (i, j) £ T s . Then by Proposition 14.21 in the probability distribution P [• |JF S _ 1 ] ($) the variables 
($ij)(i,j)ex s are mutually independent and uniformly distributed over V\Z S _\. They are also independent 
of the variable z s , because j > fci for all £ T s and the variable z s is determined by the first fci literals 
of some clause cf> s (cf. PI2). Therefore, for all £ T s the event = z s occurs with probability 

1/\V \ Z s -i\ independently. As |Z s -i| = s — 1, this shows (04), and (T43b follows from Lemma [2741 
and <f44T >. 

Let = lif t < T and there are at least n — |~fc/4] indices j £ [fc] \ [fci] such that £ Z t , and 
set Xi = otherwise. Let M. C [m] be a set of size ji = [In 2 n] and let £^4 be the event that X; = 1 for 
all i <E .M. Furthermore, let P^ C [fc] \ [fci] be a set of size k — 1 for each i € M, and let t j : P, — » [t] be a 
map. Let P = (Pi)igjvi and T = (^)i eJ n, and let £m (P, T) be the event that t < T and Zt^ij = 1 for 
a\\i £ M and all j £ P l . Let 

I = I M (V,T) = {(U(j),i,j) :i£M,j£ Pi}. 

Then d43b entails that for any P, T 



P[£ M (P,T)]<E 



< (n - 6>) HX| < (n - fl)-^"" 1 ). (45) 



Moreover, if £jvt occurs, then there exist P, T such that £m (V, T) occurs. Hence, by the union bound 

p[£m] < Y, p ^ ( v > r )l ^ A " where 



Finally, Lemma 12721 implies that with probability 1 — o(n~ 1 ) we have 

m 

< 2mA < n- 2 k (129/n) K - 1 < nexp(-k), 

as desired. □ 
Proof of Corollar\ \5.5\ We use a similar argument as in the proof of Corollary 14.71 Let 

Ul = \{x€V\(Z T UZ' t ):Ul(x)=0}\, 

set a = e/3, and define 0/1 random variables B' t for t > 1 by letting B' t = 1 iff the following statements 
hold: 

a. T" > t. 

b. W t '_j > nk ' 1 . 

c. There are less than fc/4 indices fci < j < fc such that |4ty ti3 -| G ^t- 

d. There is z £ Z' t \ Z[_ x such that U[_ x (z) > 0. 

This random variable is .^'-measurable by Fact l5.ll Let 6 = cxp(— fc Q /6). We claim 

E [B' t \T t -i] < S for any t > 1. (46) 

To see this, let $ be a formula for which a.-c. hold. We condition on the event <& = t _\ Then at time 
t the process PI1'-PI4' chooses ijj t such that $^, t contains less than three variables from Z' t _ 1 . Since 
$ satisfies c, there are less than fc/4 indices j > fci such that |3?,/> t j| £ Zt- Further, since <fr,/, t is 
(Zt, -endangered, there is no j such that ir^^ipt, j) = 1. Consequently, there are at least |fc — 
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ki — 6 > fc/5 indices fci < j < k — 5 such that ir' t _ 1 (ip t , j) = —1. Let be the set of all these indices. 

Then Proposition l5~2l entails that in the distribution P [ - 1 J?-^ x ] (^) the variables (\§>^ t j are mutually 

independent and uniformly distributed over V \ (Zt U Z' t _^). Therefore, the number of indices j G J such 
that U'^Q^jl) = has a binomial distribution Bin(|J|, |W t '_ 1 |/|V r \ (Z T U Z' t _{)\). If d. occurs, then 
there are less than three indices j 6 J such that C/ t '_ 1 (|<E > ^ t3 ;|) = 0. Since \ J\ > fc/5, b. and the Chernoff 
bound (HJ yield 

E [BJI^J (*) < P[Bin(\J\,\Ul_ l \/\V\(Z T UZ' t _ 1 )\)<3] 
< P [Bin ( [fc/5], fc"" 1 ) < 3] < S 

(provided that fc is sufficiently large). Thus, we have established (06}. 
Let y' = \{t € [0'] : = 1}|. We are going to show that 

y' < 29' 5 w.h.p. (47) 

To this end, letting /i = [In n] , we will show that 

fi-i 

E [(y' )„] < (6'5r where = J] y j. (48) 

j=o 

This implies (@7). For if y > 20'<5, then for large n we have > (26'5-nY > (1.9 ■ 6' Sy, whence 

Markov's inequality entails P [y > 20'<5] < P [(y) M > (1.96' dy] < 1.9"^ = o(l). 

In order to establish (|48T >. we define a random variable y~ for any tuple T = (ii, . . . , < M ) of mutually 
distinct integers t\, . . . , G [0]' by letting y' T = nf=i ■ Since (y ) M equals the number of /i-tuples T 
such that y- = 1, we obtain 

E [(y'U < Y, E [3>t] < ^ m ax E [y' T \ . (49) 
r 

To bound the last expression, we may assume that T is such that t\ < ■ ■ ■ < t M . As B' t is JF t ' -measurable, 
we have for all Z < /i 



E 



< E 



E 



= E 



ED 

< <5-E 



i-i 



Proceeding inductively from Z = /Lt down to Z = 1, we obtain E [y T ] < <5 M , and thus ( f4~8b follows from d49l . 

To complete the proof, let y" he the number of indices i € [to] such that |<&y | E Zt for at least fc/4 
indices ki < j < k. Combining Corollary 14. 41 (which shows that \Zt\ — T < 8 w.h.p.) with Lemma B.lOl 
we see that y" < n cxp(-fc) < 95 w.h.p. As |y < y + y", the assertion thus follows from (07]i. □ 



5.4 Proof of Corollary 15.61 

Recall that a clause f>j is (Zt, Z' t ) -endangered if for any j such that the literal $y is true under oz T the 
underlying variable lies in Z' t . Let 3^ be the set from Corollary 15.51 and let Z = [J se y Z s \ Z a -\. 
We claim that if $i is (Zt, ^ t ')-endangered, then one of the following statements is true: 

a. There are two indices 1 < j\ < j 2 < k such that \ = \*&ij 2 \- 

b. There are indices i' ^ i, ji ^ j 2 , j[ ^ j' 2 such that | = |*^-/ 1 and |*y 2 1 = |$^|. 

c. $i is unsatisfied under az T ■ 

d. <fr; contains more than k = [vfcj positive literals, all of which lie in Z[ U Zt- 

e. 4>i has at most k positive literals, is satisfied under az T , and contains a variable from Z. 
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To see this, assume that is (Zt, Z' t ) -endangered for some t < T' and a.-d. do not hold. Also observe 
that Z D Zt ft Z' t by construction (cf. PI2'); hence, if there is an index j such that $y = x for some 
x G Zt, then x G Z, and thus e. holds. Thus, assume that no variable from Zt occurs negatively in 4?^. 
Then <&i contains Z > 1 positive literals from V \ Zt, and we may assume without loss of generality that 
these are just the first I literals &n , <&u . Furthermore, . . . , <f>u G Z[. Hence, for each 1 < j < I 
there is 1 < tj < t such that $y G Z' t . \ Z' t ._ x . Since <I>; satisfies neither a. nor b., the numbers t\,...,ti 
are mutually distinct. (For if, say, t\ — t2, then either €>;i = &i2, or 4^ and &^ tl have at least two 
variables in common.) Thus, we may assume without loss of generality that t\ < ■ ■ ■ < ti. Then i G U' t t 
by the construction in step PI3', and thus 3>u G Z. Hence, e. holds. 

Let X ai . . . , X e be the numbers of indices i G [to] for which a.,. . . ,e. above hold. W.h.p. X a + 
Xb = 0(\nn) by Lemma 1231 Furthermore, X c < exp(— k E / 8 )n w.h.p. by Proposition 13.21 Moreover, 
Lemmas |4. 101 and 15.91 yield Xd < 2exp(— n/2)n w.h.p. Finally, since y < W exp(— k E / 4 ) w.h.p. by 
Corollary [53] and as \Z\ = 3\y\, Lemma l2~6l shows that w.h.p. 

X e < JO 1 -9exp(-fc £ / 4 )n < 0'/2. 
Combining these estimates, we obtain X a + ■ ■ ■ + X e < 6' w.h.p. 



6 Proof of Proposition 1X4 



As before, we let < e < 0.1, and we assume that k > fco for a large enough fco = ko(e), and that 
n > no for some large enough uq = no(e,k). Furthermore, we let m = [(1 — e)2 k k^ 1 lnfcj, u = 
(1 — e) In k and ki = \k/2] . We keep the notation introduced in Section 14.11 In particular, recall that 
9 = [Ank^ 1 lnwj. 

In order to prove that the graph G(<&, Z, Z') has a matching that covers all (Z, Z') -endangered clauses, 
we are going to apply the marriage theorem. Basically we are going to argue as follows. Let Y C Z' be 
a set of variables. Since Z' is "small" by Proposition 13. 31 Y is small, too. Furthermore, Phase 2 ensures 
that any (Z, Z') -endangered clause contains three variables from Z' . To apply the marriage theorem, we 
thus need to show that w.h.p. for any Y C Z' the number of (Z, Z') -endangered clauses that contain 
only variables from Y U (V \ Z') (i.e., the set of all (Z, Z') -endangered clauses whose neighborhood in 
G(*, Z, Z') is a subset of Y) is at most \Y\. 

To establish this, we will use a first moment argument (over sets Y). This argument does actually not 
take into account that Y C Z', but it works for any "small" set Y C V. Thus, let Y C V be a set of size 
yn. We define a family (yij)ie[m],je[k] °f random variables by letting 



Vij 



1 if|*y|GF, 
otherwise. 



Moreover, define for each integer t > an equivalence relation = J on f2fc(n, m) by letting $ =J $' iff 
tt s [<£>] = 7r s [$'] for all < s < t and yij [$] = y^- [$'] for all (i , j) G [m] x [k] . This is a refinement of the 
equivalence relation = t from Section |47T~I Let T\' be the cr-algebra generated by the equivalence classes 
of =1 . Then the family (J r Y)t>o is a filtration. Since contains the cr-algebra Tt from Section |4~T1 all 

t -measurable are 



random variables that are Tt -measurable are Tj -measurable as well. 



Proposition 6.1 Let £j be the set of all pairs such that TT t (i,j) G {1,-1} and yij = 0. The 
conditional joint distribution of the variables j)££ Y given is uniform over (V \ (Z t U Y)) £ * . 

Proof. Let [$]^ be the =^-class of a formula $. Then P$ = P [-l^ 7 ^] ($) is just the uniform distribution 
over [$]^. Let V\ '($) be the set of all pairs G [m] x k such that |$y| G Y and n t (i,j) [$] G 

{ — 1,1}. We will actually prove the following stronger statement: with respect to the measure P$ the joint 
distribution of the variables (\&ij\)(ij) e £YuvJ i s uniform over (V\(Z t UY)) £ t x {Y\Z t ) v K 
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To show this, we use a similar argument as in the proof of Proposition 14.21 For any two maps / 
£j ($) -»V\(yuZ ( ($)) and 3 : Vj ($) — * Y \ Z t ($) we define a formula 



(*/, 9 )i 



1. 



f(i,j) if 6 £ t ($)and7r (i,j) 
/(»,j) if G and 7r (i,j) = 1, 

if (», j) G 2? t (*) and7r (i, j) = -1, 
g(i,j) if G 2? t (*) and7r (i,j) = 1, 



otherwise. 



Then $y g =^ Therefore, the map 



(V\(Z t UY)f? x (Y\Z t ) 2 



[*]*, (f,g) » */, fl 



is bijection. 

For any t > 1, i G [to], j G [k] we define a 0/1 random variable 7~L]fij by letting 
t < T, 7t t -i(i,j) = 1 and7r t (i, j) 



1 if Vii 



□ 
0, 



27. 



Lemma 6.2 For any sef J E\ [9] x [to] x [fc] we have E JT^ i .j gI \Fq < (n — 0) I 1 '. 



Proof. Due to Proposition 16.11 the proof of Lemma l4T8l carries over directly. □ 

For a given set Y we would like to bound the number of i G [m] such that 3>i contains at least three 
variables from Y and has no positive literal in V \ (Y U Z-r). If for any "small" set Y the number of 
such clauses is less than |Y|, then we can apply this result to Y = Z' and use the marriage theorem to show 
that G($, Z, Z') has the desired matching. We proceed in several steps. 

Lemma 6.3 Let t < 9, let M. C [to] be a set of size fi, and let L, A be maps that assign a subset of [k] to 
each i G M. such that 

L(i) n A(i) = and | A(i)| > 3 for all i G M. (50) 
Let £(Y, t,A4,L, A) be the event that the following statements are true for all i G Ai: 
a. G Y for all j G A(i). 

/?. 3>jj /s a negative literal for all j E [fc] \ U A(i)). 

c. $y G Z t \ Y for all j G 

^ i = E l& M \ L d)\ and A = EieM |A(i)|. ^en P [£(Y, t,M,L, A)] < 2- fc "(2t/n)'(2y)\ 

Proo/ Let £ = £(Y, t,A4,L, A). Let ti be a map L(i) — ► [i] for each i G A4, let T = (i^eM, an d let 
£ (T) be the event that a. and b. hold and $y = ^(j) for alii E M and j G L(i). If £ occurs, then there 
is T such that £(T) occurs. Hence, by the union bound 



P [£} < P \- £ ( T )l ^ ** ™f xP t £ ( T ) 



(51) 



To bound the last term fix any T. Let I = {(s, i, j) : i E A4,j G £(£), s = *z (j ) }■ If £(X) occurs, then 



1 for all (s, i, j) G X. Therefore, by Lemma Rx2 



p[£(T)\rZ] < e 



n 



< (n 



-m 



(n - 0)~ 



(52) 



Furthermore, the event that a. and b. hold for all i E M is JF^-measurable. Since the literals are 
chosen independently, we have 



P [a. and b. hold for all i E M] < y x 2 x - k ^ = (2y) X 2"^ 



(53) 



6 PROOF OF PROPOSITION ?? 



29 



Combining g2) and (ED, we obtain P [£ (T)] < 2- fe ''(n - 0)"' (2y) A . Finally, plugging this bound 
into (IBTl i, we get 



P [f ] < 2"*" (2y) A < 2"*" (|) (2yY 



as desired. □ 

Corollary 6.4 Let t < 9, and let M C V have size \M\ = /t. Let l,Xbe integers such that A > 3/i. 
Let £(Y,t,A4, 1, A) be the event that there exist maps L, A that satisfy ( 1501 ) such that I = ^2 ieM \L(i)\, 
A = ^2 ieM |A(i)|, and the event £(Y,t,M, L,A) occurs. Then 

P[£(Y,t,M,l,X)] <2- l - kll (2k 2 y) x . 

Proof. Given I, A there are at most (^) ways to choose the maps L, A (because the clauses in M. contain 
a total number of kfj, literals). Therefore, by Lemma 1631 and the union bound 

2^P[8(Y,t,M,lA)] < (^)(2f/»)W^^^ 

= 2~ l (2ky) x • w -50M-«ina whereas l - . (54) 

50/i In a; 

Since -a In a < 1/2, we obtain ^-BO/i-aW < ^-25^ < (i n fc) 25 ^ < fc A . Plugging this last estimate 
into (l54l yields the desired bound. □ 



Corollary 6.5 Lef t < 9 and let £(t) be the event that there are sets Y C V, A4 C [m] of size 3 < 
|Y| = = \i < nk~~ 12 and integers I > 0, A > 3/i suc/i f/zaf the event £(Y,t,A4, 1, A) occurs. Then 
P [f (t)] = o(l/n). 

Proof. Let us fix an integer 1 < /i < nk~ 12 and let £(t,fi) be the event that there exist sets Y, .M of 
the given size /i = j/n and numbers I, A such that £(Y,t,M,l,X) occurs. Then the union bound and 
Corollary l64lvield 

p [£(*,/*)] < 2 E E p [^ i -^ 1 ' A )H(!)(j 2 "' , W' i 



< 



e 2 2 fc lnc^ M 



2 2 - k »(2k 2 y) 3 » < 4 [yfc 6 f < y-^ 2 . 



ky 2 

Summing over 3 < fx < nk- 12 , we obtain P [£(*)] < E M p [£(*> /■»)] = 0(n~ 3/2 )- □ 
Proof of Proposition \3.4\ Assume that the graph G($, Z, Z') does not have a matching that covers all 
(Z, Z') -endangered clauses. Then by the marriage theorem there are a set Y C Z' and a set M. of (Z, Z')- 
endangered clauses such that \M\ = |Y| > and all neighbors of indices i G A4 in the graph G(G>, Z, Z') 
lie in Y. Indeed, as each (Z, Z') -endangered clause contains at least three variables from Z' ', we have 
|Y| > 3. Therefore, for each clause i £ M the following three statements are true: 

a. There is a set A(z) C [k] of size at least 3 such that |«I>^ | £ Y for all j 6 A(i). 

b. There is a (possibly empty) set c [fc] \ A(i) such that #^ S Z for all j £ £(«)■ 

c. For all j e [fc] \ U A(i)) the literal <I>jj is negative. 
As a consequence, at least one of the following events occurs: 

1. T> 8 = L4fc- 1 ln^J. 

2. |Z'| > nk~ 12 . 

3. There is t < 9 such that £(t) occurs. 

The probability of the first event is o(l) by Proposition 13.21 the second event has probability o(l) by 
Proposition l3.3l and the probability of the third event is 9 ■ o(n _1 ) = o(\) by Corollary 16. 5 1 □ 
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